Clustering: Evolutionary Approaches

نویسندگان

  • Mihaela Elena Breaban
  • Henri Luchian
چکیده

This thesis is concerned with exploratory data analysis by means of Evolutionary Computation techniques. The central problem addressed is cluster analysis. The main challenges arisen from the unsupervised nature of this problem are investigated. Clustering is a problem lacking a formal general-accepted objective. This justifies the multitude of approaches proposed in literature. A review of the main clustering algorithms and clustering objectives is made. A new approach that takes into account both global and local distribution in data is proposed with the aim of combining the strengths of two different clustering paradigms: centroid-based approaches and density-based approaches. The use of distance metrics in cluster analysis and their impact on the solution space are discussed. The field of metric learning is reviewed. Special emphasis is placed on feature selection methods that aim at extracting a lower-dimensional manifold from data, manifold that maximizes the clustering tendency in data. A wrapper scenario based on multi-modal search evolutionary algorithms is investigated in order to identify feature subsets relevant for the clustering task. A new clustering criterion is formulated able to offer a ranking of partitions derived in feature subspaces of different cardinalities. Particular clustering problems are approached with Evolutionary Computation techniques. Community detection in social networks based on local trust metrics raise a new challenge to clustering analysis: the underlying feature space can not be transformed straightforward into a metric space. Graph clustering is formulated as a multi-objective problem in order to address important applications in VLSI design.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proposing a Novel Cost Sensitive Imbalanced Classification Method based on Hybrid of New Fuzzy Cost Assigning Approaches, Fuzzy Clustering and Evolutionary Algorithms

In this paper, a new hybrid methodology is introduced to design a cost-sensitive fuzzy rule-based classification system. A novel cost metric is proposed based on the combination of three different concepts: Entropy, Gini index and DKM criterion. In order to calculate the effective cost of patterns, a hybrid of fuzzy c-means clustering and particle swarm optimization algorithm is utilized. This ...

متن کامل

A partition-based algorithm for clustering large-scale software systems

Clustering techniques are used to extract the structure of software for understanding, maintaining, and refactoring. In the literature, most of the proposed approaches for software clustering are divided into hierarchical algorithms and search-based techniques. In the former, clustering is a process of merging (splitting) similar (non-similar) clusters. These techniques suffered from the drawba...

متن کامل

Multi-layer Clustering Topology Design in Densely Deployed Wireless Sensor Network using Evolutionary Algorithms

Due to the resource constraint and dynamic parameters, reducing energy consumption became the most important issues of wireless sensor networks topology design. All proposed hierarchy methods cluster a WSN in different cluster layers in one step of evolutionary algorithm usage with complicated parameters which may lead to reducing efficiency and performance. In fact, in WSNs topology, increasin...

متن کامل

Improved Automatic Clustering Using a Multi-Objective Evolutionary Algorithm With New Validity measure and application to Credit Scoring

In data mining, clustering is one of the important issues for separation and classification with groups like unsupervised data. In this paper, an attempt has been made to improve and optimize the application of clustering heuristic methods such as Genetic, PSO algorithm, Artificial bee colony algorithm, Harmony Search algorithm and Differential Evolution on the unlabeled data of an Iranian bank...

متن کامل

Evolutionary User Clustering Based on Time-Aware Interest Changes in the Recommender System

The plenty of data on the Internet has created problems for users and has caused confusion in finding the proper information. Also, users' tastes and preferences change over time. Recommender systems can help users find useful information. Due to changing interests, systems must be able to evolve. In order to solve this problem, users are clustered that determine the most desirable users, it pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011